Corpus: zho-simp_news_2011_1M

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 8742 1-
2 5527 2-
3 3769 S-
4 3330 3-
5 3206 C-
Top Character Bigrams
word rank frequency n-gram
1 1141 10-
2 1089 20-
3 1016 11-
4 852 19-
5 851 12-
Top Character Trigrams
word rank frequency n-gram
1 321 200-
2 298 201-
3 240 600-
4 166 第11-
5 164 100-
Top Character 4-Grams
word rank frequency n-gram
1 160 2011-
2 66 穆里尼奥-
3 65 Inte-
4 58 第110-
5 57 第111-
Top Character 5-Grams
word rank frequency n-gram
1 44 Power-
2 42 Super-
3 42 iPhon-
4 39 Googl-
5 39 Inter-
9900 msec needed at 2018-03-31 15:31